Image capture for a multi-dimensional building model

ABSTRACT

A process for creating, responsive to a desired analysis, estimate information from received imagery by determining whether the imagery meets a minimum criterion for creating a multi-dimensional model that can address the desired analysis and building an aggregate set of images meeting the minimum criterion from which a multi-dimensional model is then built and an estimate responsive to the desired analysis is derived. The multi-dimensional model, and the responsive estimate, may be refined as additional images are provided to the aggregate set. Desired analyses may include dimensions such as square footage of a roof for building objects.

CROSS REFERENCE TO RELATED PATENTS/PATENT APPLICATIONS

The present U.S. Utility Patent Application claims priority pursuant to35 U.S.C. § 120, as a continuation of U.S. Utility patent applicationSer. No. 16/545,067, entitled, “IMAGE CAPTURE FOR A MULTI-DIMENSIONALBUILDING MODEL,” filed Aug. 20, 2019, which is a continuation of U.S.Utility patent application Ser. No. 15/942,786, entitled “IMAGE CAPTUREFOR A MULTI-DIMENSIONAL BUILDING MODEL,” filed Apr. 2, 2018, issued asU.S. Pat. No. 10,410,413 on Sep. 10 2019, which is acontinuation-in-part (CIP) of U.S. Utility patent application Ser. No.15/166,587, entitled “GRAPHICAL OVERLAY GUIDE FOR INTERFACE,” filed May27, 2016, issued as U.S. Pat. No. 9,934,608 on Apr. 3, 2018, whichclaims priority pursuant to 35 U.S.C. § 119(e) to U.S. ProvisionalApplication No. 62/168,460, entitled “GRAPHICAL OVERLAY GUIDE FORINTERFACE,” filed May 29, 2015, all of which are hereby incorporatedherein by reference in their entirety and made part of the present U.S.Utility Patent Application for all purposes.

BACKGROUND Technical Field

The technology described herein relates generally to a system and methodfor creating multi-dimensional building models (e.g., 3D), and inparticular, to a system and method for processing a series of imagecaptures to develop and refine building feature dimensions.

Description of Related Art

Some efforts have been made to generate three-dimensional (3D) texturedmodels of buildings via aerial imagery or specialized camera-equippedvehicles. However, these 3D models have limited texture resolution,geometry quality, accurate geo-referencing and are expensive, timeconsuming and difficult to update and provide no robust real-time imagedata analytics for various consumer and commercial use cases.

Disadvantages of conventional approaches will be evident to one skilledin the art when presented in the disclosure that follows.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates one embodiment of a system architecture in accordancewith the present disclosure;

FIG. 2 illustrates a flowchart representing the process for accuratelyguiding a user to capture images used to create a 3D building model inaccordance with the present disclosure;

FIG. 3 illustrates a diagram of geo-referencing a building location inaccordance with the present disclosure;

FIG. 4 illustrates a diagram of capturing ground-level images of abuilding object in accordance with the present disclosure;

FIG. 5 illustrates an example of collected ground-level images for afaçade of a building in accordance with the present disclosure;

FIGS. 6A-6F, collectively, illustrate a set of guide overlays inaccordance with the present disclosure;

FIG. 7 illustrates one embodiment of graphical set-up selections foraccurately initiating selection of building graphical overlays inaccordance with the present disclosure;

FIG. 8 illustrates an example series of building graphical overlays fora complex residential building in accordance with the presentdisclosure;

FIG. 9 illustrates a diagrammatic representation of a machine in theexample form of a computer system in accordance with the presentdisclosure; and

FIG. 10 illustrates processing a series of perspective building imageryusable in a multi-dimensional model construct.

DETAILED DESCRIPTION

FIG. 1 illustrates one embodiment of system architecture in accordancewith the present disclosure. In one embodiment, image processing system100 includes image processing servers 102. Images database (DB) 104 andimage processing servers 102 are coupled via a network channel 106.

Network channel 106 is a system for communication. Network channel 106includes, for example, an Ethernet or other wire-based network or awireless NIC (WNIC) or wireless adapter for communicating with awireless network, such as a WI-FI network. In other embodiments, networkchannel 106 includes any suitable network for any suitable communicationinterface. As an example, and not by way of limitation, the networkchannel 106 can include an ad hoc network, a personal area network(PAN), a local area network (LAN), a wide area network (WAN), ametropolitan area network (MAN), or one or more portions of the Internetor a combination of two or more of these. One or more portions of one ormore of these networks may be wired or wireless. As another example,network channel 106 can be a wireless PAN (WPAN) (such as, for example,a BLUETOOTH WPAN), a WI-FI network, a WI-MAX network, a 3G, 4G or 5Gnetwork, LTE, a cellular telephone network (such as, for example, aGlobal System for Mobile Communications (GSM) network).

In one embodiment, network channel 106 uses standard communicationstechnologies and/or protocols. Thus, network channel 106 can includelinks using technologies such as Ethernet, 802.11, worldwideinteroperability for microwave access (WiMAX), 3G, 4G, LTE, CDMA,digital subscriber line (DSL), etc. Similarly, the networking protocolsused on network channel 106 can include multiprotocol label switching(MPLS), the transmission control protocol/Internet protocol (TCP/IP),the User Datagram Protocol (UDP), the hypertext transport protocol(HTTP), the simple mail transfer protocol (SMTP), and the file transferprotocol (FTP). In one embodiment, the data exchanged over networkchannel 106 is represented using technologies and/or formats includingthe hypertext markup language (HTML) and the extensible markup language(XML). In addition, all or some of links can be encrypted usingconventional encryption technologies such as secure sockets layer (SSL),transport layer security (TLS), and Internet Protocol security (IPsec).

In one or more embodiments, image processing servers 102 includesuitable hardware/software in the form of circuitry, logic gates, and/orcode functions to process ground-level images to include, but notlimited to, calculation of one or more architectural featuremeasurements. Capture device(s) 108 is in communication with imageprocessing servers 102 for collecting images of building objects.Capture devices 108 are defined as electronic devices for capturingimages. For example, the capture devices include, but are not limitedto: a camera, a phone, a smartphone, a tablet, a video camera, asecurity camera, a closed-circuit television camera, a computer, alaptop, a webcam, wearable camera devices, photosensitive sensors, IRsensors, lasers, equivalents or any combination thereof.

Image processing system 100 also provides for viewer device 110 that isdefined as a display device. For example, viewer device 110 can be acomputer with a monitor, a laptop, a touch screen display, a LED array,a mobile phone, a smartphone, a tablet, a television set, a projectordisplay, a wearable heads-up display of some sort, or any combinationthereof. In one or more embodiments, the viewer device includes displayof one or more building façades and associated measurements, such as,for example, a conventional desktop personal computer having inputdevices such as a mouse, keyboard, joystick, or other such input devicesenabling the input of data and interaction with the displayed images andassociated measurements.

In one embodiment, an image processing system is provided for uploadingto image processing servers 102 ground-level images of a physicalbuilding from a capture device. An uploaded image is, for example, adigital photograph of a physical building, for example, showing a cornerwith one or more sides of the physical building. Image processingservers construct a 2D/3D building model using the uploaded groundimages with known or future 2D/3D model construction methodologies.

FIG. 2 illustrates a flowchart representing a process for guiding a userof a capture device 108 (e.g., smartphone) to more accurately capture aseries of ground level images of a building. Ground level images arecaptured as the picture taker moves around the building—taking aplurality (e.g., 4-16 for an entire building) of ground level imagesfrom multiple angles and distances. The series of captured ground levelimages will be uploaded to both image database 104 and image processingservers 102 to be stored and processed to create the 3D building modeland returned to the user.

Process 200 begins in step 202 with initiation of a capture device 108(e.g., smartphone with camera). Initiation may include one or more of:determining location and/or aspect 203 (i.e., perspective, such asaligned with right corner of building) as further described inassociation with FIG. 3 ; determining user information (name, login,account info, etc.); determining if the subject of the ground-levelimage is a residential or commercial building 203 manually from userselection or automatically from location determination/perspective(e.g., pointing in a specific direction at a determined location); orusing real-time image-based computer vision techniques. For example, theuser's location can be determined from GPS or user entry of an address.Using sensors of the capture device (e.g., pitch, yaw and roll axis), adetermination can be made as to which surrounding building is thesubject of the capture (i.e., facing a specific building on the southside of the street at a specific location). In one embodiment,image-based algorithms identify building features in real-time thatconfirm or reinforce expected building type assumptions. For example, ata known location, a two-story structure with gable roof is expected.

In step 204, a first overlay from a set of sequential graphical overlayguides is retrieved for display on capture device 108. For example, afirst overlay guide illustrating a 3D perspective graphic can be afront/right guide as shown in FIG. 6B. The front/right overlaid guideincludes a 3D perspective front image façade, right corner edge andpartial right-side façade of a building and, in one embodiment, isdisplayed as a semi-transparent graphic (i.e., can see through). Othergraphical overlays, including 2D, may be used without departing from thescope of the present disclosure. The graphical overlay guides are notused to assist in focusing the camera. In step 206, the system receivesa selection of which overlay guide best matches the present perspectiveof the capture device (manually from the user or automatically based onlocation/perspective/other image processing and/or computer visionalgorithms running in near-real time on the image capture device as isknown in the art). Optional selections 203, such as simple (one story)or complex structure (two-story, multiple sections, etc.), may assist ingenerating an appropriate set of specific overlay guides. For example,if the user is right of center facing a simple structure building, anexample front/right overlay guide would be best. While the system willautomatically determine perspective based on visual cues in the cameraimage (still or video sequence), if the user is not aligned with theappropriate guide of the building, they can sequence through the set ofoverlay guides until arriving at the appropriate overlay guide for theirposition/perspective. In an alternative embodiment, a corresponding 3Doverlay is automatically selected to match a capture device detectedorientation of the subject building. In an alternative embodiment, auser is prompted to start with a specific overlay (e.g., front).

In step 208, using the overlay guide as an alignment tool, the capturedevice camera image is substantially aligned (perfect alignment notrequired) with the overlay guide. Typically, the user would have ontheir display an image as seen, for example, from their smartphonecamera viewfinder and thereafter align the corner of the building in thedisplay with the corner in the overlay guide (see FIG. 6B). Verticalalignment is also possible using the upper/lower edges of side panes ofthe overlay guide. Once aligned, a picture is taken (captured) and theprocess recognizes (associates) that a captured image (photo) of thebuilding from the selected overlay guide is taken. The picture can betaken manually by the user or automatically taken when substantiallyaligned with the overlay guide. In addition, client-side feedback canassist the user, for example, visual indicators such as text prompts orcolor indicators (green (great), yellow (good), red (bad)); flashing orhighlighting, etc. or audio indicators (e.g., “move left,” “move right,”“up,” “down” and “center building”).

Visual indicators of image quality are included 216 to determine imagequality (Good, Bad or Best Available). Image quality can be determinedin real time (e.g., milliseconds) either using onboard software orremotely using server processing (102) and may include historical data(similar image comparison) for refinement or learning capability. The‘best available” option, in one embodiment, is selected manually by theuser based on location, distance restrictions, camera restrictions, etc.or automatically based on a number of failures to capture a good image.If the image capture is bad (e.g., left/right/top/bottom boundaries cutoff), the user is prompted to either retake the image or select bestavailable.

In step 210, the overlay guide is sequentially advanced (e.g., movingcounter-clockwise around the building) guiding the user to a nextposition to take another image of the building. The process continues instep 212 until the building is fully captured (e.g., four corners).While only one corner image of the building is required to minimallycapture building 3D information, the quality, accuracy and completenessof a 3D model of the building created from the images will improve witha greater number and better circumferential distribution (e.g., manysides, corners and perspectives).

In step 214, the captured images are uploaded to image processingservers 102 to create a 3D model. The technology described herein is notlimited by the method to produce the 3D building model. In one exampleembodiment, the images are stored locally in capture device memory forlocal processing, uploaded in real-time, discreetly or as a group, toimages DB 104 or to another computer/server memory for storage beforeprocessing in 102. In one example embodiment, the images are uploadedto/from third party image services (e.g., Flickr®, Facebook®, Twitter®,etc. or other mobile apps, such as Zillow®, Craigslist®, etc.) firstbefore being uploaded to image processing servers 102/images DB 104. Foranother example, the images are transferred first from a camera to anetworked computer (e.g., cloud-based server system), and then to imageprocessing servers 102/images DB 104. Even though the embodiments aredescribed with remote (server) processing, the entire process of thetechnology described herein can be performed on the capture deviceitself.

In one embodiment, key architectural geometric features of the 3D modelare identified. Architectural features include, but are not limited to:windows, doors, siding, gutters, roofing, roof intersection vertices,ground intersection vertices, bay window vertices, vents and/oroverhanging vertices. Scaled measurements may also, in one embodiment,be included for one or more of the architectural features. Inalternative embodiments, the identification of architectural geometricfeatures is performed semi-automatically, manually by a reviewer, orfully automatically. They may be performed in real-time or nearreal-time on the image capture device, partially on a server or entirelyon a server.

FIG. 3 illustrates a diagram of geo-referencing a building location inaccordance with the present disclosure (step 202). In one embodiment,capture devices include a global positioning satellite (GPS) system forproviding location information. The capture device 301 is used toidentify a location using the GPS information of a current location. Asshown in FIG. 3 , the capture device displays map 302 showing itscurrent location 303. In one embodiment, confirmation of the GPSdetermined location information is required. In alternative embodiments,the location of a building is determined by manually typing in anaddress or selecting a point on a map (e.g., 302). In the embodimentsdisclosed herein, the location of the building is not required toimplement the methods described for image capture and/or to create the3D building model.

FIG. 4 illustrates a diagram of capturing ground-level images of abuilding object in accordance with the present disclosure. As shown, aplurality of ground-level images for building object 401 are collectedby smartphone 402 (capture device 108). Each ground-level image is takenat a different perspective relative to the other images of the buildingobject. For example, smartphone 402 collects ground-level images at afirst position, a second position, a third position, a fourth positionand a fifth position (as shown 1, 2, 3, 4, 5). An overhead perspectivediagram 403 illustrates picture taking positions 1-6 as correlatingleft-to-right in the front of the building. The various embodiments ofthe present disclosure encourage a user to take a plurality of imageswith a corner and one or more sides visible at various perspectives(angles) circumferentially around the entire building. For example, animage captured from segment 6, would capture a perspective including afront façade 405, right corner 406 and right façade 407 perspective(FIG. 6B). An image captured from segment 7 may equate to FIG. 6C.

FIG. 5 illustrates an example of collected ground-level images forbuilding 401 in accordance with the present disclosure. Ground-levelimages 500 provide for different perspectives of the building (centeredon the façade). Ground-level images 501 to 505 capture the building fromfive perspectives panning, for example, left-to-right (counterclockwise) the façade of the building. Ground-level image 501 providesfor the left most building façade perspective relative to the otherground-level images. Ground-level image 505 provides for the right mostbuilding façade perspective relative to the other ground-level images.Ground-level images 502 through 504 represent the building façade fromthree additional perspectives between ground-level images 501 and 505.While shown as 5 images of the front of the building, the user continuesaround the building taking photos as suggested by the guide overlaysuntil the building is captured.

FIGS. 6A-6F, collectively, illustrate a set of guide overlays inaccordance with the present disclosure. For brevity purposes, theillustrations are limited to six overlays. However, a more complete setwould include a plurality of overlay guides to capture the entirebuilding including all sides/corner with various perspectives of thebuilding. For example, a plurality of ground-level images is collectedfor each visible side and/or corner of the building using the overlayguides. In an alternate example embodiment, when a building is part of alarger collection of buildings (i.e., a townhome, a store in a stripmall, etc.) not all sides of the building are accessible forground-level images as at least one side is shared with an adjacentbuilding and therefore many of the guides are skipped from thesequential set of graphical guides.

As shown, smartphone 602 includes a display section 604. When a cameraof the smartphone is activated for taking a photo, the digitalviewfinder shows the subject of the picture (in this case a building ofinterest 606) in display 604. Overlay guides 610 a-610 f are sequentialcounter-clockwise perspectives of the building. The user simply alignsthe overlay building guide with the subject building in the display andtakes the picture. As previously mentioned, substantially aligning acorner in the subject image displayed with a corner of the overlay guideis typically a good starting point for alignment (but not required). Inaddition, ensuring that the entire building object is roughly alignedwith the orientation of the overlay guide improves quality of capture.As illustrated, the sides are named “Front, Right, Left, or Back” 608 or“F, R, Lor B” 606 when perspective space does not allow for the completewording. Sequencing though the overlay guides prompts a user tosequentially move around the building using, for example, theirsmartphone in a structured manner to better capture a plurality ofimages capturing the entire building.

In one embodiment, pixels of the orthogonal image are geo-referencedwith accurate spatial representation in the world coordinate system.Geo-referenced orthogonal images therefore include distances betweencoordinate points. By correlating the ground-level building façade withknown geo-referenced orthogonal images, the geo-spatial position of eachpixel within the ground-level building façade is determined and can beused to accurately measure various architectural features of the façade.The described embodiments are, for simplicity, described for only asingle exterior façade, but can be repeated for any number of façades(sides), including the roof (top) or interior façades/surfaces and arenot limited to buildings. However, while described as an embodiment, thetechnology disclosed herein can be equally performed without orthogonalimagery, aerial and/or satellite imagery to capture the ground levelimages using the overlay guides and subsequent 3D construction of the 3Dbuilding model.

In one embodiment, known standard measurements, for example a heightfrom a door threshold to the center of a door knob is used to scale the3D building model. The scaling can be used with any known standardmeasurement or ratio (e.g., known standard width-to-height ratios ofstandardized building architectural features (e.g., doors, windows,etc.) located within a captured image.

FIG. 7 illustrates one embodiment of assisting the user in setting up agraphical building overlay selection. In one embodiment, graphicalset-up guides include determining user information (name, login, accountinfo, etc.) and/or determining if the subject of the ground level imageis a residential or commercial building (manually from user selection701 or automatically from location determination and perspective). Inaddition, selections 702, such as simple (one story) or complexstructure (two-story, multiple sections, etc.), assist in generating anappropriate set of specific building overlay guides.

FIG. 8 illustrates one embodiment of assisting a user in taking a seriesof perspective pictures (ground-level images) of a residential building.FIGS. 801-805 , collectively, illustrate a set of semi-transparentbuilding graphical overlays in accordance with the present disclosure.For brevity purposes, the illustrations are limited to five overlays.However, a more complete set would include a plurality overlay guides tocapture the entire building including all sides/corners with variousperspectives of the building. Based on a selection of residential andtwo stories, a semi-transparent complex two story 2D/3D building overlayis projected on the user's smartphone (capture device 108). As shown inelement 801, a front image as seen through the capture device viewfinder(e.g., screen) is displayed with a front semi-transparent buildingoverlay guide. The user would substantially align (e.g., center andlevel) the front building image with the corresponding overlay by movingthe capture device and take the picture when substantially aligned.Additional sequential overlays are illustrated, such as Front Right 802,Right Back 803, Back 804 and Back Left 805. As illustrated, thesemi-transparent complex two story 2D/3D building overlay each include aperspective to match a perspective of the building in the imageviewfinder (e.g., display screen). In an alternative embodiment, usingreal-time tracking and computer imaging, the perspective of the buildingoverlay can be automatically changed to fit the actual buildingperspective.

Referring now to FIG. 9 , therein is shown a diagrammatic representationof a machine in the example form of a computer system 900 within which aset of instructions, for causing the machine to perform any one or moreof the methodologies or modules discussed herein, may be executed.Computer system 900 includes a processor, memory, non-volatile memory,and an interface device. Various common components (e.g., cache memory)are omitted for illustrative simplicity. The computer system 900 isintended to illustrate a hardware device on which any of the componentsdepicted in the example of FIG. 1 (and any other components described inthis specification) can be implemented. The computer system 900 can beof any applicable known or convenient type. The components of thecomputer system 900 can be coupled together via a bus or through someother known or convenient device.

This disclosure contemplates the computer system 900 taking any suitablephysical form. As example and not by way of limitation, computer system900 may be an embedded computer system, a system-on-chip (SOC), asingle-board computer system (SBC) (such as, for example, acomputer-on-module (COM) or system-on-module (SOM)), a desktop computersystem, a laptop or notebook computer system, an interactive kiosk, amainframe, a mesh of computer systems, a mobile telephone, a personaldigital assistant (PDA), a server, or a combination of two or more ofthese. Where appropriate, computer system 900 may include one or morecomputer systems 900; be unitary or distributed; span multiplelocations; span multiple machines; or reside in a cloud, which mayinclude one or more cloud components in one or more networks. Whereappropriate, one or more computer systems 900 may perform withoutsubstantial spatial or temporal limitation one or more steps of one ormore methods described or illustrated herein. As an example, and not byway of limitation, one or more computer systems 900 may perform in realtime or in batch mode one or more steps of one or more methods describedor illustrated herein. One or more computer systems 900 may perform atvarious times or at different locations one or more steps of one or moremethods described or illustrated herein, where appropriate.

The processor may be, for example, a conventional microprocessor such asan Intel Pentium microprocessor or Motorola power PC microprocessor. Oneof skill in the relevant art will recognize that the terms“machine-readable (storage) medium” or “computer-readable (storage)medium” include any type of device that is accessible by the processor.

The memory is coupled to the processor by, for example, a bus. Thememory can include, by way of example but not limitation, random accessmemory (RAM), such as dynamic RAM (DRAM) and static RAM (SRAM). Thememory can be local, remote, or distributed.

The bus also couples the processor to the non-volatile memory and driveunit. The non-volatile memory is often a magnetic floppy or hard disk, amagnetic-optical disk, an optical disk, a read-only memory (ROM), suchas a CD-ROM, EPROM, or EEPROM, a magnetic or optical card, or anotherform of storage for large amounts of data. Some of this data is oftenwritten, by a direct memory access process, into memory during executionof software in the computer 900. The non-volatile storage can be local,remote, or distributed. The non-volatile memory is optional becausesystems can be created with all applicable data available in memory. Atypical computer system will usually include at least a processor,memory, and a device (e.g., a bus) coupling the memory to the processor.

Software is typically stored in the non-volatile memory and/or the driveunit. Indeed, for large programs, it may not even be possible to storethe entire program in the memory. Nevertheless, it should be understoodthat for software to run, if necessary, it is moved to a computerreadable location appropriate for processing, and for illustrativepurposes, that location is referred to as the memory in this paper. Evenwhen software is moved to the memory for execution, the processor willtypically make use of hardware registers to store values associated withthe software, and local cache that, ideally, serves to speed upexecution. As used herein, a software program is assumed to be stored atany known or convenient location (from non-volatile storage to hardwareregisters) when the software program is referred to as “implemented in acomputer-readable medium.” A processor is considered to be “configuredto execute a program” when at least one value associated with theprogram is stored in a register readable by the processor.

The bus also couples the processor to the network interface device. Theinterface can include one or more of a modem or network interface. Itwill be appreciated that a modem or network interface can be consideredto be part of the computer system 900. The interface can include ananalog modem, ISDN modem, cable modem, token ring interface, satellitetransmission interface (e.g., “direct PC”), or other interfaces forcoupling a computer system to other computer systems. The interface caninclude one or more input and/or output devices. The I/O devices caninclude, by way of example but not limitation, a keyboard, a mouse orother pointing device, disk drives, printers, a scanner, and other inputand/or output devices, including a display device. The display devicecan include, by way of example but not limitation, a cathode ray tube(CRT), liquid crystal display (LCD), or some other applicable known orconvenient display device. For simplicity, it is assumed thatcontrollers of any devices not depicted reside in the interface.

In operation, the computer system 900 can be controlled by operatingsystem software that includes a file management system, such as a diskoperating system. One example of operating system software withassociated file management system software is the family of operatingsystems known as Windows® from Microsoft Corporation of Redmond, Wash.,and their associated file management systems. Another example ofoperating system software with its associated file management systemsoftware is the Linux™ operating system and its associated filemanagement system. The file management system is typically stored in thenon-volatile memory and/or drive unit and causes the processor toexecute the various acts required by the operating system to input andoutput data and to store data in the memory, including storing files onthe non-volatile memory and/or drive unit.

FIG. 10 illustrates one embodiment 1000 of processing a series ofperspective pictures (ground-level images), for example, of aresidential building, usable in a multi-dimensional model construct. Auser of the present application technology may be capturing images of abuilding but may not want to wait for a complete rendering of a 3D modelwith complete architectural feature dimensions and/or a completematerials replacement report. For example, the user may only desire abasic estimate of building roofing/flooring area in order to get a quickestimate for materials replacement/repairs. In the various embodimentsdescribed in association with FIG. 10 , a user can request varyingquality levels of dimensional estimation (preliminary to final) for oneor more desired architectural features (e.g., roof, doors, paintablesurfaces, window siding, etc.). In addition, as additional images arecaptured of the subject building (e.g., various angles, distances,positions), the quality of dimensions improves as these additionalimages are processed.

For example, a user may receive a real-time estimate of basic squarefootage of a building after capturing only a single image of a firstcorner and associated two sides (processed as a rectified box withwidth, length and height reflective of the two sides). As the user movesaround the building capturing more images, various bump-outs, openings,variances in walls or roof sections, specific architectural features(windows, doors, paintable surfaces, siding areas, etc.) will becaptured enabling a higher quality estimate of dimensions of thesespecific architectural features.

In step 1002, a series of captured building images are received by acomputing device. Typically, these captured images reflect cameraimagery taken at various angles and distances from a building object. Instep 1004, each building image in the series of captured building imagesis processed in real-time to determine if each building image meets aminimum criterion, where the minimum criterion includes applicabilityfor constructing a multi-dimensional building model. A minimum criterioncan be adjusted by a requestor. For example, a user only needing a roughestimate of floor square footage could define a minimum criterion thatminimally captures an outline of the building (e.g., a corner and twoassociated sides).

In step 1006, each of the building images meeting the minimum criterionis aggregated (e.g., stored as a designated building file in computermemory). In step 1008, it is determined when a base set of buildingimages has been aggregated (ranging from a single image of one cornerand associated two sides to imagery capturing all corners and sides ofthe subject building). A base set of building images includes athreshold number images to model at least a partial multi-dimensionalbuilding model representing the series of captured building images. Forexample, a very rudimentary base model (basic box) could be built usinga single image of a corner and both associated sides. While this partialmodel (i.e., not including all façade features) is incomplete, it can beutilized to calculate an estimate of basic sizing and area calculations.

Once a base set of building images has been aggregated, in step 1010,one or more façades present in the base set of building images areidentified. A façade can be any surface such as sides or top (roof). Instep 1012, one or more architectural features and/or openings of the oneor more façades are identified (e.g., walls, floor area, paintablesurfaces, siding, doors and/or windows, roofing areas, vents, garages,etc.). The one or more architectural features and/or openings areidentified using a database of previously stored building images 1014and operatively connected machine learning systems/algorithms 1016 usingimage processing techniques known in the art. For example, machinelearning techniques can include, but are not limited to, trained neuralnetworks, deep learning modules, artificial intelligence modules, ormachine learning modules.

In step 1018, preliminary dimensions (basic estimate of: square footage,total paintable surface area, total siding materials, door and/or windowdimensions, roofing materials, vents, etc.) for the one or morearchitectural features of the one or more façades and/or openings aredetermined using known imagery scaling and dimensional analysis systems.In step 1018, incrementally, the preliminary dimensions are optionallyreturned to the computing device.

The system continues to process received imagery related to the subjectbuilding using the above steps (e.g., 1010-1018) until it meets thedimensional quality and architectural feature dimensional requirementsof the requester or until all images have been received. In oneembodiment, the preliminary dimensions are returned to the computingdevice in real-time as they are computed with a complete (final) set ofpreliminary dimensions for one or more architectural features (e.g.,doors and windows) returned after all images have been processed.

As new imagery (e.g., additional angles, sides, corners, better qualityimages, etc.) becomes available (are captured and received by imageprocessing system), the quality of the preliminary dimensions(estimates) is refined (e.g., capturing various bump-outs, openings,secondary structures (e.g., garage) until, in step 1020, the dimensionsare finalized (highest accuracy) and optionally returned to thecomputing device in real-time with or without a dimensional report(e.g., door sizes). For example, preliminary dimensions are continuouslyreturned to the computing device until at least one complete set offinal dimensions for the one or more architectural features is returned.Continuously can include, but is not limited to, whenever newpreliminary/final estimates are available, when preliminary/finalestimates for a specific feature are available, or when a complete setof final dimensions is available for one or more architectural features.

In step 1022, a multi-dimensional model (e.g., 3D) is constructed andoptionally returned to the computing device (or third party) with one ormore requested dimensions (and optionally material reports) for one ormore architectural features.

In one embodiment, receiving a series of captured building imagesincludes receiving a request to return one of: the preliminarydimensions (i.e., rough estimate), continuously refined dimensions,final dimensions, a completed multi-dimensional building model, acompleted multi-dimensional building model with the preliminarydimensions, a completed multi-dimensional building model with finaldimensions or a completed multi-dimensional building model with amaterials report (e.g., various door and window sizes, types, costs,etc.).

The technology as described herein may have also been described, atleast in part, in terms of one or more embodiments. An embodiment of thetechnology as described herein is used herein to illustrate an aspectthereof, a feature thereof, a concept thereof, and/or an examplethereof. A physical embodiment of an apparatus, an article ofmanufacture, a machine, and/or of a process that embodies the technologydescribed herein may include one or more of the aspects, features,concepts, examples, etc. described with reference to one or more of theembodiments discussed herein. Further, from figure to figure, theembodiments may incorporate the same or similarly named functions,steps, modules, etc. that may use the same or different referencenumbers and, as such, the functions, steps, modules, etc. may be thesame or similar functions, steps, modules, etc. or different ones.

While particular combinations of various functions and features of thetechnology as described herein have been expressly described herein,other combinations of these features and functions are likewisepossible. For example, the steps may be completed in varied sequences tocomplete the building image captures. The technology as described hereinis not limited by the particular examples disclosed herein and expresslyincorporates these other combinations. It is noted that terminologies asmay be used herein have been used interchangeably to describe digitalinformation whose content corresponds to any of a number of desiredtypes (e.g., image, video, text, etc. any of which may generally bereferred to as ‘data’).

As may be used herein, the terms “substantially” and “approximately”provides an industry-accepted tolerance for its corresponding termand/or relativity between items. Such an industry-accepted toleranceranges from less than one percent to fifty percent. Such relativitybetween items ranges from a difference of a few percent to magnitudedifferences. As may also be used herein, the term(s) “configured to,”“operably coupled to,” “coupled to,” and/or “coupling” includes directcoupling between items and/or indirect coupling between items via anintervening item (e.g., an item includes, but is not limited to, acomponent, an element, a circuit, and/or a module). As may further beused herein, inferred coupling (i.e., where one element is coupled toanother element by inference) includes direct and indirect couplingbetween two items in the same manner as “coupled to.” As may evenfurther be used herein, the term “configured to,” “operable to,”“coupled to,” “operably coupled to” indicates that an item includes oneor more of power connections, input(s), output(s), etc., to perform,when activated, one or more its corresponding functions and may furtherinclude inferred coupling to one or more other items. As may stillfurther be used herein, the term “associated with,” includes directand/or indirect coupling of separate items and/or one item beingembedded within another item.

As may be used herein, the term “compares favorably,” indicates that acomparison between two or more items, signals, etc., provides a desiredrelationship. For example, when the desired relationship is that signal1 has a greater magnitude than signal 2, a favorable comparison may beachieved when the magnitude of signal 1 is greater than that of signal 2or when the magnitude of signal 2 is less than that of signal 1. As maybe used herein, the term “compares unfavorably,” indicates that acomparison between two or more items, signals, etc., fails to providethe desired relationship.

As may also be used herein, the terms “processing module,” “processingcircuit,” “processor,” and/or “processing unit” may be a singleprocessing device or a plurality of processing devices. Such aprocessing device may be a microprocessor, micro-controller, digitalsignal processor, microcomputer, central processing unit, fieldprogrammable gate array, programmable logic device, state machine, logiccircuitry, analog circuitry, digital circuitry, and/or any device thatmanipulates signals (analog and/or digital) based on hard coding of thecircuitry and/or operational instructions. The processing module,module, processing circuit, and/or processing unit may be, or furtherinclude, memory and/or an integrated memory element, which may be asingle memory device, a plurality of memory devices, and/or embeddedcircuitry of another processing module, module, processing circuit,and/or processing unit. Such a memory device may be a read-only memory,random access memory, volatile memory, non-volatile memory, staticmemory, dynamic memory, flash memory, cache memory, and/or any devicethat stores digital information. Note that if the processing module,module, processing circuit, and/or processing unit includes more thanone processing device, the processing devices may be centrally located(e.g., directly coupled together via a wired and/or wireless busstructure) or may be distributedly located (e.g., cloud computing viaindirect coupling via a local area network and/or a wide area network).Further note that if the processing module, module, processing circuit,and/or processing unit implements one or more of its functions via astate machine, analog circuitry, digital circuitry, and/or logiccircuitry, the memory and/or memory element storing the correspondingoperational instructions may be embedded within, or external to, thecircuitry comprising the state machine, analog circuitry, digitalcircuitry, and/or logic circuitry. Still further note that, the memoryelement may store, and the processing module, module, processingcircuit, and/or processing unit executes, hard coded and/or operationalinstructions corresponding to at least some of the steps and/orfunctions illustrated in one or more of the Figures. Such a memorydevice or memory element can be included in an article of manufacture.

One or more embodiments have been described above with the aid of methodsteps illustrating the performance of specified functions andrelationships thereof. The boundaries and sequence of these functionalbuilding blocks and method steps have been arbitrarily defined hereinfor convenience of description. Alternate boundaries and sequences canbe defined so long as the specified functions and relationships areappropriately performed. Any such alternate boundaries or sequences arethus within the scope and spirit of the claims. Further, the boundariesof these functional building blocks have been arbitrarily defined forconvenience of description. Alternate boundaries could be defined aslong as the certain significant functions are appropriately performed.Similarly, flow diagram blocks may also have been arbitrarily definedherein to illustrate certain significant functionality.

To the extent used, the flow diagram block boundaries and sequence couldhave been defined otherwise and still perform the certain significantfunctionality. Such alternate definitions of both functional buildingblocks and flow diagram blocks and sequences are thus within the scopeand spirit of the claims. One of average skill in the art will alsorecognize that the functional building blocks, and other illustrativeblocks, modules and components herein, can be implemented as illustratedor by discrete components, application specific integrated circuits,processors executing appropriate software and the like or anycombination thereof.

In addition, a flow diagram may include a “start” and/or “continue”indication. The “start” and “continue” indications reflect that thesteps presented can optionally be incorporated in or otherwise used inconjunction with other routines. In this context, “start” indicates thebeginning of the first step presented and may be preceded by otheractivities not specifically shown. Further, the “continue” indicationreflects that the steps presented may be performed multiple times and/ormay be succeeded by other activities not specifically shown. Further,while a flow diagram indicates a particular ordering of steps, otherorderings are likewise possible provided that the principles ofcausality are maintained.

The one or more embodiments are used herein to illustrate one or moreaspects, one or more features, one or more concepts, and/or one or moreexamples. A physical embodiment of an apparatus, an article ofmanufacture, a machine, and/or of a process may include one or more ofthe aspects, features, concepts, examples, etc. described with referenceto one or more of the embodiments discussed herein. Further, from figureto figure, the embodiments may incorporate the same or similarly namedfunctions, steps, modules, etc. that may use the same or differentreference numbers and, as such, the functions, steps, modules, etc. maybe the same or similar functions, steps, modules, etc. or differentones.

Unless specifically stated to the contra, signals to, from, and/orbetween elements in a figure of any of the figures presented herein maybe analog or digital, continuous time or discrete time, and single-endedor differential. For instance, if a signal path is shown as asingle-ended path, it also represents a differential signal path.Similarly, if a signal path is shown as a differential path, it alsorepresents a single-ended signal path. While one or more particulararchitectures are described herein, other architectures can likewise beimplemented that use one or more data buses not expressly shown, directconnectivity between elements, and/or indirect coupling between otherelements as recognized by one of average skill in the art.

The term “module” is used in the description of one or more of theembodiments. A module implements one or more functions via a device suchas a processor or other processing device or other hardware that mayinclude or operate in association with a memory that stores operationalinstructions. A module may operate independently and/or in conjunctionwith software and/or firmware. As also used herein, a module may containone or more sub-modules, each of which may be one or more modules.

As may further be used herein, a computer readable memory includes oneor more memory elements. A memory element may be a separate memorydevice, multiple memory devices, or a set of memory locations within amemory device. Such a memory device may be a read-only memory, randomaccess memory, volatile memory, non-volatile memory, static memory,dynamic memory, flash memory, cache memory, and/or any device thatstores digital information. The memory device may be in a form asolid-state memory, a hard drive memory, cloud memory, thumb drive,server memory, computing device memory, and/or other physical medium forstoring digital information.

What is claimed is:
 1. A method of processing imagery comprising:receiving a plurality of input camera images comprising imagery of abuilding object; receiving a desired estimate analysis related to thebuilding object within the imagery, wherein the desired estimateanalysis comprises a requested dimension of the building object;determining, for each of the received input cameras images, whether aminimum criterion is met, wherein the minimum criterion indicatesapplicability for constructing a multi-dimensional building model basedon the desired estimate analysis; aggregating an image set comprisingeach of the input camera images meeting the minimum criterion;constructing a base multi-dimensional building model of the buildingobject from the aggregated image set; and returning a responsiveestimate based on the constructed base multi-dimensional building model,wherein the responsive estimate comprises the requested dimension of thebuilding object.
 2. The method of claim 1, wherein the requesteddimension is a square footage estimate of a surface of the buildingobject.
 3. The method of claim 2, wherein the square footage estimate ofthe surface of the building object further includes a replacementmaterials report comprising replacement costs of a replacement materialbased on the square footage estimate.
 4. The method of claim 2, whereinthe surface of the building object is a roof surface.
 5. The method ofclaim 1, wherein the minimum criterion comprises detecting at least onespecific feature.
 6. The method of claim 5, wherein the detecting the atleast one specific feature comprises detecting a corner of the buildingobject.
 7. The method of claim 1, wherein the base multi-dimensionalmodel comprises a multi-dimensional building model constructed from athreshold number of images from the aggregated image set to derive theresponsive estimate.
 8. The method of claim 1, wherein the responsiveestimate comprises a calculated dimension from at least one measurementof the base multi-dimensional building model.
 9. The method of claim 1,wherein constructing the base multi-dimensional model further comprisesapplying a scale to the base multi-dimensional model.
 10. The method ofclaim 9, wherein the responsive estimate comprises a calculateddimension based on at least one scaled measurement of the basemulti-dimensional building model.
 11. The method of claim 9, wherein thescale is based on geo-referenced spatial information.
 12. The method ofclaim 9, wherein the scale is based on architectural features of thebase multi-dimensional model.
 13. The method of claim 1, furthercomprising updating the base multi-dimensional building model withadditional ones of input cameras images exceeding a threshold number ofimages from the aggregated image set to derive the responsive estimate.14. The method of claim 13, further comprising calculating a refinedresponsive estimate based on at least one measurement from the updatedbase multi-dimensional building model.
 15. A non-transitory computerreadable medium storing instructions that, upon execution, cause acomputing system to perform operations including: receiving a pluralityof input camera images comprising imagery of a building object;receiving a desired estimate analysis related to the building objectwithin the imagery, wherein the desired estimate analysis comprises arequested dimension of the building object; determining, for each of thereceived input cameras images whether a minimum criterion is met,wherein the minimum criterion indicates applicability for constructing amulti-dimensional building model based on the desired estimate analysis;aggregating an image set comprising each of the input camera imagesmeeting the minimum criterion; constructing a base multi-dimensionalbuilding model of the building object from the aggregated image set; andreturning a responsive estimate based on the constructed basemulti-dimensional building model, wherein the responsive estimatecomprises the requested dimension of the building object.
 16. Thenon-transitory computer readable medium of claim 15, wherein therequested dimension is a square footage estimate of a surface of thebuilding object.
 17. The non-transitory computer readable medium ofclaim 16, wherein the square footage estimate of the surface of thebuilding object further includes a replacement materials reportcomprising replacement costs of a replacement material based on thesquare footage estimate.
 18. The non-transitory computer readable mediumof claim 16, wherein the surface of the building object is a roofsurface.
 19. The non-transitory computer readable medium of claim 15,wherein the minimum criterion comprises detecting at least one specificfeature.
 20. The non-transitory computer readable medium of claim 19,wherein the detecting the at least one specific feature comprisesdetecting a corner of the building object.
 21. The non-transitorycomputer readable medium of claim 15, wherein the base multidimensionalmodel comprises a multi-dimensional building model constructed from athreshold number of images from the aggregated image set to derive theresponsive estimate.
 22. The non-transitory computer readable medium ofclaim 15, wherein the responsive estimate comprises a calculateddimension from at least one measurement of the base multi-dimensionalbuilding model.
 23. The non-transitory computer readable medium of claim15, wherein constructing the base multi-dimensional model furthercomprises applying a scale to the base multi-dimensional model.
 24. Thenon-transitory computer readable medium of claim 23, wherein theresponsive estimate comprises a calculated dimension based on at leastone scaled measurement of the base multidimensional building model. 25.The non-transitory computer readable medium of claim 23, wherein thescale is based on geo-referenced spatial information.
 26. Thenon-transitory computer readable medium of claim 23, wherein the scaleis based on architectural features of the base multi-dimensional model.27. The non-transitory computer readable medium of claim 15, furthercomprising updating the base multi-dimensional building model withadditional ones of input cameras images exceeding a threshold number ofimages from the aggregated image set to derive the responsive estimate.28. The non-transitory computer readable medium of claim 27, furthercomprising calculating a refined responsive estimate based on at leastone measurement from the updated base multi-dimensional building model.